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Abstract 

We extend the notion of Gibbsianness for mean-field systems to the set-up of 
general (possibly continuous) local state spaces. We investigate the Gibbs properties 
of systems arising from an initial mean-field Gibbs measure by application of given 
local transition kernels. This generalizes previous case-studies made for spins taking 
finitely many values to the first step in direction to a general theory, containing the 
following parts: (1) A formula for the limiting conditional probability distributions 
of the transformed system. It holds both in the Gibbs and non-Gibbs regime and 
invokes a minimization problem for a "constrained rate-function". (2) A criterion 
for Gibbsianness of the transformed system for initial Lipschitz-Hamiltonians in- 
volving concentration properties of the transition kernels. (3) A continuity estimate 
for the single-site conditional distributions of the transformed system. While (2) 
and (3) have provable lattice-counterparts, the characterization of (1) is stronger 
in mean-field. As applications we show short-time Gibbsianness of rotator mean- 
field models on the (q — l)-dimensional sphere under diffusive time-evolution and 
the preservation of Gibbsianness under local coarse-graining of the initial local spin 
space. 
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1 Introduction 



The study of the (failure of the) Gibbs property is a source of interesting probabiUty 
theory and is Unked to the study of phase-transitions. Gibbs measures and generahzed 
Gibbs measures are of interest not only on the lattice, but on more general structures. 
Examples of such structures are random graphs, or, in the simplest conceivable case, 
the complete graph, where the models are called mean-field models. 

The Gibbs property of a given measure should be viewed as a continuity property 
of conditional probabilities as a function of the conditioning. When one tries to prove 
or disprove this property for a measure obtained by an application of a deterministic 
or stochastic transformation from a well-understood initial measure one is led to a 
constrained (or "quenched") problem, with "quenched impurities" that are induced by 
the conditioning. This introduces a "random" (or in better words constrained) system 
that we need to understand [10], and this ties the problem to disordered systems and 
statistical mechanics on random structures. 

It was through rigorous implementation of Renormalization Group transformations 
that it was discovered that images of Gibbs measures can be non-Gibbs [1011181 [T9l I14j . 
After this discovery, there has been an interest in recent times in particular in the study 
of the loss and possible recovery of the Gibbs property of an initial Gibbs measure 
under a stochastic time-evolution. The study started in p] where the authors focused 
on the evolution of an initial Gibbs measure of a lattice spin Ising model under high- 
temperature spin- flip Glauber dynamics. The main phenomenon observed here was 
the loss of the Gibbs property after a certain transition time when the system was 
started from an initial low temperature state. The measure stays non-Gibbs forever 
when the initial external field is zero. More complicated transitions between Gibbs and 
non-Gibbs are possible at intermediate times when there is no spin-flip symmetry. The 
case of site-wise independent diffusions of continuous spins on the lattice starting from 
the Gibbs-measure of a special double- well potential was considered in ^6], exhibiting 
similarities and differences to the Ising case. In [11] the authors studied models for 
continuous compact spins, namely the planar rotor models on the circle subjected to 
diffusive time-evolution. It is shown therein that starting with an initial low-temperature 
Gibbs measure, the time-evolved measure obtained for infinite- or high-temperature 
dynamics stays Gibbs for short times and for the corresponding initial infinite- or high- 
temperature Gibbs measure under infinite- or high-temperature dynamics, the time- 
evolved measure stays Gibbs forever. Their analysis uses the machinery of cluster 
expansions, as earlier developed in [3]. Even before it was shown that the whole process 
of space-time histories can be viewed as a Gibbs measure j5j which however does not 
imply that fixed-time projections are Gibbs. 

Let us move from concrete examples to the elements of a general theory which 
have been proved so far. In [23] the preservation of the Gibbs property for compact 
(discrete and continuous) spin models for general initial interactions (having a finite 
"triple-norm") subjected to general site- wise transformations is studied. The technique 
employed therein is Dobrushin uniqueness [H |T7] , which is quite robust and gives rise to 
explicit estimates. We obtained both quantitative estimates on the parameter regimes 
where Gibbsianness provably holds and, as the main new part, explicit continuity esti- 
mates for the conditional probabilities of the transformed system. 

As an example it is shown therein that starting with an initial Gibbs measure of a 
rotator spin model on the {q — l)-dimensional sphere {q > 2) and performing site-wise 
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independent diffusive time-evolutions, the Gibbs property is preserved in an explicitly 
computable time-interval starting from zero. Similar conclusions were drawn for Gibbs 
measures for general initial interactions (with compact metric local spin-spaces) sub- 
jected to a local coarse-graining transformation. (Given a decomposition of the local 
state-space S into countably many sets, the corresponding local coarse-graining is the 
map that associates to any point in S the label of the corresponding set in the decompo- 
sition.) Here the Gibbs property is preserved whenever the diameter of the largest set in 
the decomposition is small enough. Roughy speaking, this result can be seen as stability 
of Gibbsianness under application of a ball of sufficiently fine local transformations of 
coarse-graining type. 

In a related line of research, transforms of initial Gibbs measures for various mean- 
field models were investigated. A variety of measures has been found to be non-Gibbs 
[25} [23l [20] in the mean-field sense. Usually the analysis of such systems shows parallels 
to what can be done on the lattice but goes much further. We remark that in all the 
cases studied so far, mean-field spins that take finitely many values had been considered, 
and a unifying treatment including discrete and continuous spins had been lacking. For 
state of the art reviews on Gibbsianness and non-Gibbsianness we refer the reader to 

Now, in this note we present a systematic investigation of the Gibbs property of 
mean-field measures subjected to local kernels. We are out to extend previous results 
on spins taking finitely many values to general possibly continuous (but compact) spins. 
More mathematical care is needed since we consider distributions of empirical measures 
taking values in an infinite-dimensional space. So, let us provide an informal roadmap 
of the present paper now, leaving the precise definitions and statements of the theorems 
to the main body. 

What are the initial measures we are dealing with? We start in Section [2] by defining 
a class of interactions ^{i^) as functions on empirical measures v of the system. The 
corresponding mean-field Hamiltonian in a volume of size N is N^{i'). The densities 
of the finite-volume Gibbs-measures w.r.t. an a-priori product measure a in volume 
are given in terms of the normalized exponential ex.p{—N^{u)). 

The first decision to be made is to find an appropriate notion of regularity of allowed 
interactions It turns out here that the natural requirement (the suitable mean- 
field analogue of the standard notion of absolute summability for interactions on the 
lattice) is that of continuous differentiability (in the space of measures on the single-site 
configuration space). 

Next, we define the notion of mean-field Gibbsianness of a model which is given in 
terms of the sequence of its finite- volume measures, by looking at large- volume limits 
of single-site conditional distributions obtained from this. This procedure provides us 
with a kernel 7i((i(7j|i^) where i' is the empirical measure of a configuration in the 
conditioning. The model is called Gibbs if every v is a continuity point of 7 (for the 
weak topology). This is a natural generalization from the discrete spin-examples where 
this notion had been introduced and investigated before. From this definition it is also 
apparent that the regularity requirement on the interaction made above is natural since 
it implies Gibbsianness of the initial system (see more on this below Definition 13. 5p . 

The situation is easier (and thus amenable to analysis) for mean-field models than 
for lattice models, since a configuration in the conditioning is replaced by a measure v on 
the local spin space, and it is just one single-site kernel that captures the large-volume 
behavior. 
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In Section [3] we turn to the main focus of the paper, namely two-layer models, ob- 
tained by applying a stochastic kernel, independently over the sites, to the initial model. 
A complete analysis of the Ising model in mean-field under stochastic site-wise indepen- 
dent time-evolution has been given in [25], showing the emergence of non-Gibbsianness 
at sharp critical times and a phenomenon called symmetry-breaking in the set of bad 
configurations. More examples are found in |231l20j. At first we develop the general the- 
ory which relates our desired object, the large-system limiting conditional distribution 
of the transformed system, to a variational problem. In this part no specific assump- 
tions (other than continuous differentiability of the initial potential) will be made on 
the model. The results hold in regions of the parameter space of the interaction where 
both Gibbsianness and non-Gibbsianness can occur. In the non-Gibbsian regime how- 
ever we have to stay away from the specific critical values of the conditionings for which 
non-unique global minimizers occur. For the convenience of the reader we briefly re- 
view some background material on large deviations we will use for our analysis. Large 
deviation principles are interesting in themselves, but from the point of view of this 
paper, they will just be used as a tool to treat the limiting conditional probabilities. 
The main general result of this first general part is Theorem 13.111 which describes the 
infinite-volume second-layer conditional probabilities in terms of a solution of a varia- 
tional problem (leading to a consistency equation) for the constrained first layer model 
(CFLM). 

In Section S] we provide criteria for Gibbsianness of the transformed model. This 
part is based on the study of the constrained consistency equation obtained in the 
first part of the paper. By Tychonovs theorem there exists at least one solution. By 
the contraction mapping theorem there is precisely one solution, provided the respective 
kernel is Lipschitz, uniformly in the conditioning, with a constant L which can be derived 
explicitly, when L < 1. Uniqueness of the solution implies mean- field Gibbsianness of the 
transformed model, by the first part. This is in nice analogy to the corresponding lattice 
results obtained in the paper [24J using techniques based on Dobrushin uniqueness. More 
can be said however about the transformed system, and can be put in perspective with 
corresponding lattice results. 

In [23] we were proving Gibbsianness but we did more than that. We provided 
explicit continuity estimates of the form 

where j'iidijil'qic) are single-site conditional probabilities for the transformed system, 
Qij is the so-called Goodness-matrix, and d' is the so-called posterior metric. The 
posterior metric is the variational distance between constrained single-site measures 
d'{rji,rj[) = \\K{dai\rj.i) — K{dai\r]^)\\ where K is the joint single-site a priori measure 
(obtained in terms of a and the transformation kernel). 

In the present mean-field setup we prove as the main result of the second part of 
the paper an estimate of the form 

H{dri^W)--f[{dr^i\u')\\<L2\\v-v'\\ 

with L2 given in Theorem 14. 3[ In the lattice estimate there is a matrix Q appearing, 
describing the spatial decay of influence of a variation of the conditioning at site j while 
in the mean-field estimate we are simply considering the variational distance of the 
empirical measure of the conditioning. 
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L2 will be finite for an initial interaction that is arbitrarily large but Lipschitz when 
the constrained single-site measures have good concentration properties. This is the 
case e.g. at short-times for diffusive time-evolutions, or for sufficiently fine local coarse- 
grainings. When the initial interaction is small the transformation plays no role, and 
L2 is finite always. 

We conclude the paper with the discussion of stochastic time-evolutions and local 
coarse-grainings in Section [5l 



2 Generalities on Mean-Field Models 
2.1 Set-up 

Let {S, d) and (5', d') be two given compact Polish spaces (compact separable metric 
spaces), each equipped with their corresponding Borel cr-algebras. We denote by 'P{S), 
M+{S) and M{S) {V{S'), M+{S') and M{S')) the spaces of probability measures, 
finite positive measures and finite signed measures on S (S") respectively. Let a and 
a' be two given reference Borel probability measures (also called the a priori measures) 
on S and S' respectively . In the following we will refer to S as the initial (first- 
layer) single-site spin space and S' as also the transformed (second-layer) single-site 
spin space. We respectively write Q = and Q' = S'^ as the configuration spaces 
for the initial (first-layer) and the transformed (second-layer) systems. In the sequel we 
will write probability measures for the transformed system with primes and those for 
the joint system (comprising of the initial and transformed systems) with tildes. The 
probability measures for the initial system will always be written without primes and 
tildes. Again we denote by a, rj and ^ the spin variable for the initial, the transformed 
and the joint systems respectively, (e.g ^ = ((Tj,?7j)jgN € O = (S" x 5')^). We further 
set Vat = {1, • ■ ■ 1 ^} and write ay^f for points in the product space S^. We will simply 
write a instead of crp^. We now define the following concept of mean-field interaction for 
the initial systems that we shall consider in this work. 

Definition 2.1 We shall refer to a map ^ : Mj^^^S) ^ M as a proper mean-field 
interaction if it satisfies the following conditions: 

1. it is weakly continuous and 

2. it satisfies the uniform directional differentiability condition, meaning that, for 
each V G 7W + (5) the derivative ^^^^i/jfi) at v in direction /x exists and we have 



with lim(_^o+ ^ = uniformly in £ A4{S) for which v -\- tjjL ^ M.j^{S), for 



For each mean-field interaction <I> and each G N we define the finite-volume 
Hamiltonian Hj^ (a real- valued function on the product space S^) as 





t G (0,1]. 





(2) 
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where L^y (cw) = X^i^i is the empirical measure. Observe from the permutation 
invariance of the empirical measures that Hj^j is also permutation invariant. With 
this notation we define the finite-volume Gibbs measure ^iji^N for the finite-volume 
Hamiltonian i^^v and at inverse temperature (5 as 

where we've used ® to denote tensor product of measures. In the following, unless 
otherwise stated, the inverse temperature /? will be absorbed into the interaction 
In view of this, we will write instead of H/s^jy- It follows from the permutation 
invariance of the H^^s and de Finetti's theorem that fj.^ has weak infinite- volume 
limits (Gibbs measures) which are convex combinations of product measures [13] . A 
variational characterization of these infinite-volume measures and related results will 
be the content of another paper which will appear elsewhere. In our current set-up we 
will always assume these infinite-volume measures exist for the class of interactions we 
consider. 

2.2 Transforms of Mean-Field Models 

We now introduce on S" x S" a Borel probability measure K such that 

K{d^i) = k{ai,r]i)a{dai)a'{drji), with sup | log/c(cTi,r/i)| < oo. /a\ 

We assume further that a = J k{-,rji)a'{dr]i) and a' = J k{ai, ■)a{dai), where we are 
using the subscript z G N to convey the idea that the K is the joint a priori measure for 
site i. Given a Gibbs measure fi for the initial model ([3l), it is our aim in this work to 
investigate the Gibbs properties of the transformed measures 

/i'(dr/) = / fi{da)Y\k{ai,r]i)a'{dr]i) (5) 

as has been done for the corresponding short-range models in j24j . The study, as 
in |24) . will be based on investigating the properties of the finite- volume conditional 
distributions of the transformed system. This consists in studying the infinite- volume 
A^— limits of the following finite-volume quantities: 

„ N n 

•^•^^ j=n+l i=l 

for fixed n E N with 1 < n < A. But unlike in the lattice spin systems the boundary 
conditions here will be fixed up to permutations, i.e., each boundary condition ??Vjv\v„ 
will be a representative of a class of configurations which gives rise to the same empirical 
measure. In view of this, we shall take probability measures in V{S') as our boundary 
conditions. As we shall show below, the infinite-volume A^-limit of /i^ ^ will always 
factorize and this factorization necessitates the study of the n = 1 case. This leads 
to the following definition of Gibbsianness for mean-field models which was originally 
introduced by one of the authors of this paper for the corresponding Curie- Weiss model 
|251 [23] . The case studied here is a generalization of this notion from empirical average 
to empirical measures. 
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Definition 2.2 We call v' G V{S') a good configuation if and only if 

1. the limit 

1 ^ 

-/[{dr]i\X') := lim ti[ ^ {dm\wN\{i}) "'^^re A' = lim — (7) 

NJoo Njoo I\ — ^ 

exists for all X' in a weak neighborhood of v' and 

2. for any Borel subset A C 5', the function X' 7[(j4|A') is weakly continuous at 
X' = u'. 

We say /i' is Gibbs iff every configuration is good. 

In what follows (unless otherwise stated) continuity of maps on V{S') will always be 
w.r.t. the weak topology. 

In our investigation of the continuity properties of the single-site kernels for the 
transformed system we employ the machinery of large deviations theory. In view of 
this, we will recall some basic facts about large deviations theory that we will need in 
our analysis in the next subsection. 

2.3 Some facts about Large Deviations Theory 

In this subsection we recall some facts about large deviations theory and for detailed 
discussion on this theory and its application to statistical mechanics we refer the reader 
to [3l[7j. Let X be a Polish space equipped with its Borel cr— algebra. 

Definition 2.3 A sequence of probability measures {QN)Nen inV{X) is said to satisfy 
a large deviation principle (LDP) on X with rate on (sequence of positive numbers 
tending to infinity) and rate function / : X ^ [0, +cxd] if 

1. I is lower semi- continuous on X , and the level sets {x £ X : I{x) < a} are 
compact for all a G [0, +00); 

2. for any Borel subset B of X, 

-I{B) < liminf a^^logQ7v(B) < limsupa^^ logQ7v(B) < -1(13), (8) 

where for any subset C of X, I{C) = mix^c and C and C are respectively 
the interior and the closure of C . 

As an example take {Yn)n&n^ an i.i.d. sequence of random variables on X with p as 
the law of Yi. Let Qat be the distribution of the empirical measures = Yl^=i ^Yi- 
Then Qn satisfies LDP with rate N and rate function 



S{v\p) 



{ !%^og%dp if and %\og%£C\p) 

(9) 

+00 otherwise. 



The above example is Sanov's theorem in large deviations theory as can be found e.g. 
in Theorem II. 4. 3 of 171. 
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Another important fact about LDP that we shall employ in our study is the con- 
traction principle (see e.g. Theorem II. 5.1 of [7J), which comes to play when one is 
concerned with partial summary of the information weighted by Qjv. More precisely, 
suppose is a continuous function from the Polish space X to another Polish space Y 
and Qat is a sequence of probability measures on X satisfying the LDP with rate ajv 
rate function /. Then the sequence Qj\f = Qn o 'ip~^ of probability measures on Y also 
satisfies LDP with rate and rate function / given by 

/(y) =inf{/(x) : =2/}. (10) 

Our last fact from LDP concerns the integrals of exponentials of functionals of 
random variables whose distributions satisfy LDP. This is found e.g. in [7] as Theorem 
II. 7. 2a. The result in [7] is more general than what is stated here. 

Fact 2.4 Let X be a Polish space and Qn a sequence of probability measures on X 
obeying LDP with rate and rate function I. Suppose that F : X ^ M, which is con- 
tinuous and bounded below, and for each N £ N the integral f-^ exp (^ — F {x))Q n {dx) 
is finite. Let Qn,f be the sequence of probability measures given by 

n (A^ X4exp(-ajvF(x))Qjv(dx) 

Qn,f[A) = (11) 

exp ( - aNF[x))QN{dx) 

for any Borel subset A of X. Then Qn,f satisfies LDP with rate and rate function 

Lf{x) = L{x) + Fix) - inf (l(y) + F{y)) . (12) 

3 Two- layer Models and Gibbsianness of Transformed Sys- 
tems 

In this section we consider mean-field systems with 5 x S" as their single-site spin 
space and K as the corresponding a priori measure. As is discussed in |24) the a priori 
measure K couples two systems; namely, the first-layer system described by some given 
mean-field interaction $ an the a priori measure a, and the second-layer system which 
are i.i.d. with distribution a' . The finite- volume Gibbs measures jiN for our two-layer 
(joint) system are given by 

exp(-iV$(Ljv(<Ty^))) Wi=iK{dai,dr^i) 

m(d^vN) 



f,^e^p[-N^{LN{avJ)) UliC^ida,) 
exp(-7v{cI>(vriLA,(^y^)) - Lr,{(v^)[logk{; ■)] }) UZi aid(Ti)a' (drj,) 

/exp(-iv{ci>(7riL^(|v^)) - L;v(lv^) [log A:(., •)] }) ail «(^^<^*)«'(^0 ' 

(13) 

where Ljv(,^Vjv) = F Si=i joint empirical measures and vriLiv(Cv'jv) — 
LNio'VN) is the projection onto the first variable. We have also denoted by z/[/] the 
integral of the measurable map / w.r.t. the measure u. Under the map Fj\f{^y^), 
the joint measures p^N have unique push-forwards QN,^,k in ViViS x S')). So we now 
study the large deviation properties of the empirical measures Ln{^Viq ) under QN,^,k 
and the proposition below is a summary of this LDP property. 



8 



Proposition 3.1 The sequence of probability measures QN,^,k satisfies an LDP with 
rate N and rate function J given by 



J(i>) = <I>(7rii>) — [log A;(-, •)] + 5(z>|a (8) a') — const where 

const = _ inf <^ $(^iA) - A log •) + 5(A| 

X&V{SxS') ^ J 



(14) 



Remark: 1. Proposition 13.11 is a direct consequence of Fact 12.41 since ^ is bounded 
and the weak topology on V{S x S') turns V{S x S') into a compact separable metric 
space. 

2. The continuous map ^ ^n{Cvjs,) from {S x S')^ to V{S x S') also induces a 
continuous map (say i^n) from V{{S x SY) into V{V{S x 5')). So the push-forwards 
QN,^,k are simply obtained by replacing the product measures YliLi in the second 
equality in (fTSj) by their images under t?Ar. 

3. Observe from Proposition l3.1l that the empirical measures of the initial spin variables 
under the initial finite- volume Gibbs measures hn © satisfy an LDP with rate and 
rate function /* given by 

I^{u) = S{u\a) + ^u)- inf k^la) + $(/.)]. (15) 
Our next task is to study the LDP for the transformed measures fi'j^ given by 

Mtv = / m{d(7vj^). (16) 

For any D £ V{S x S') we denote by 7r2t' the marginal of P on S' . Denote by Q'j^ ^ ^ 

the push- forward of /i'^y given by Q'^ ^ = QN,^,k ° '^2^- Therefore by the contraction 
principle, the empirical measures of the transformed spins under Q'j^ ^ ^ satisfy an LDP 
which we formulate in the proposition below. 

Proposition 3.2 The sequence of probability measures Qjv$fe ^'^^(^(5")) satisfies an 
LDP with rate N and rate function J' given by 

J'iv') = inf Jiv) = Siv'W) + inf J,,i(iy) — const, where 

ueViSxS') ueM^, 

J^/(P) = S{p\a®v') -F$(7riP) -z>[log/c(-,-)], 

and Myi is the subset ofV{S x S') consisting of probability measures with fixed second 
marginal v' . 

Remark: For each v' G 'P(5"), Jyi up to an additive constant (depending on v') is 
the large deviations rate function for the joint system when the second-layer system 
is constrained to configurations with empirical measure z^', i.e. J. (up to an additive 
constant) is the rate function for the CFLM. This CFLM rate function will play a key 
role in determining whether the transformed system is Gibbs or not. We shall show 
below that the continuity properties of the transformed single-site kernels 7i(-|i^') will 
be determined by the unicity of the global minimizers of the function Jyi uniformly in 
v' . Observe also that Jyi is a lower semi-continuous function and that it also attains its 
infimum on Myi ^ since M^i is compact subset of V{S x S"). Additionally, Myi is convex. 
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Proof of Proposition 13. 2t The first equality in the expression for J' (jl7p of Propo- 
sition [3?2] follows from the contraction principle, since the map 112 ■ V{S x S') — > V{S') 
is weakly continuous. Further, for each v' £ V{S') the set M^i is compact, because 
Miji = TT^"*^ ({i^'}) and by continuity of 1^2 it is a closed subset of V{S x S'). Now it also 
follows from standard results in analysis that closed subsets of compact set are compact 
and hence the compactness of M,^/, since the weak topology turns V{S x 5") into a 
compact Polish space by the compactness of 5 x S". This shows that the infimum of J 
over measures with fixed second marginal v' is attained on M^'. 

Now for each i/' E V{S'), the measures D £ Myi are of the form v{d^i) = 
v' {d'qi)i>{dai\rji), where for each r/j G S\ D{dai\r]i) £ ^{8). With this representation, 
the relative entropy for the elements in M^i w.r.t. a® a' takes the form 

S{v\a ® a') = S{v'\a') + / u' {da')S{v{-\a')\a') = S{v'\a') + S{D\a ® v'). (18) 

Js' 

This proves the second equality of the expression for J' in (fTTIl . 

□ 

The measures in Myi that are of interest in determining J^f') are at most those for 
which v{-\rii) <C a, for all rji G S' i.e., 

i>{dii) = iy'{dr]i)a{dai)f^'{ai\r]i), (19) 

for some measurable function f,^i : S x S' ^ [0, 00] with the property that for each 
rji £ S' , J a{dai) f^' {ai\r]i) = 1. We call f^' the conditional a- density of i>. This 
reduces the whole problem of minimizing J^/ over M^' to the problem of finding the 
conditional a— densities f^i for which f' {dr}i)a{d(Ji)f^i{ai\r]i) is a minimizer of Jyi. 

As our next result, we present an explicit expression for the conditional a— densities 
fui{(Ti\r]i) at which J^i attains both global and local minima. 

Theorem 3.3 For any v' £ V{S'), the function J^i attains its infimum on Myi . Fur- 
thermore, any minimizer (global or local) v £ Myi of Jyi has the conditional a-density 
fi,f which is a® i/'-a.s. strictly positive and satisfies the "constrained mean-field equa- 
tion" 

Remark: 1. As we pointed out in the above, the measures u £ Myi that are involved 
in determining J'{v') are those that takes the form (jl9[) . The minimizers of Jyi are 
among these probability measures and indeed they are those probability measures with 
fy> given by 

2. Note that the minimizers of the rate function (jlSp for the initial system are also 
measures u £ V{S) with v[dai) = f{ai)a{dai), where the /'s are a-a.s. strictly positive 
and statisfy the mean-field equation 

/(^O = r -^mtus,) w\ ^^^^ 
J e ^ ' '^i'a{dai) 
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We defer the proof of Theorem 13.31 to the Appendix at the end of the paper. 

The above consideration leads to the following definition of some probability kernel 
from iVV to S" X S". 

Definition 3.4 We refer to the map : Mj^/ M^i given by 
as the constrained first layer probability kernel ( CFLPK). 

Note that for any Borel subset ^ of x S", the map ^yi{A\-) : Myi ^ M is continuous 
by the continuity property of Observe further that not all the measures u G Myi 
have <C a, but the fixed points of do and include all the minimizers of 

It is not hard to deduce from the second remark below Theorem 13.31 that we can 
analogously define a version of the CFLPK for the initial system, namely; 

li{da,\^) = . (23) 
J e ^ ' '^^'aydai) 

Here 71 is a map from V{S) onto itself. 

We introduce the following notion of consistency for mean-field models. 

Definition 3.5 A probability measure D £ Myi is said to be consistent w.r.t. ji,' when- 
ever 

7u'mi>) = Hd^^)■ (24) 

The consistent probability measures for the CFLPK are those measures E My' for 
which <C a and have conditional a— density functions f,^' which are given by ()20p . 
Observe that the consistent probability measures are the fixed points of 7jy/. 

This notion of consistency also carries over to the kernel 71 for the initial system. 
Observe further that the kernel 71 (-jz^) is continuous as a function of the conditioning 
1^ G 'P{S). This continuity property follows as a result of the continuity property of 
<1>*^^''(-, •) in its first argument. This then implies the Gibbsianness of the initial system 
by Definition [221 

Proposition 3.6 For any probability measure v' G V{S'), the CFLPK has a fixed 
point. 

Proof: The existence of a fixed point for the CFLPK follows from the Tychonov's 
fixed point theorem, which states that for any non-empty compact convex subset X of 
a locally convex topological vector space V, and continuous function / : X — > X, there 
is a fixed point for f. 

Now as we observed from the proof of Proposition 13.21 for any 1^' G V{S') the set 
Myi is compact w.r.t. the weak topology. It is also not hard to see that it is convex. 
So we are now only left to show that the space M{S x S') of finite signed measures on 
(S X S" is locally convex topological vector space under the weak topology. 

That M.{S X S') is a locally convex topological vector space w.r.t. the weak topology 
follows from the following sequence of arguments: The total variational norm turns the 
space Ai{S X S') into a Banach space. This then implies that Ai{S x S') is normable 
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and consequently it is locally convex (i.e. the origin has a local base of convex sets) 
w.r.t. the total variational topology by Theorem 1.39 of [29j. It then follows from the 
corollary to Theorem 3.4 of [29] that the dual space M.*{S x S') M.{S x S') separates 
points in M.{S X S'). The weak topology on A^(5 x S') generated by the dual space 
Ai*{S X S') then turns M.{S x S') into a locally convex topological space by Theorem 
3.10 of [29]. 

□ 

The existence of consistent probability measures for the initial kernel 71 is trivial, since 
71 is continuous and P(5) is compact convex set. Note also that for each v' £ V{S'), 
the constrained first layer model is Gibbsian (in the sense of Definition 12. 2p by the 
continuity property of 7,^/. Indeed, the CFLPK ^^,1 can be extended to the whole of 
7^(5" X S') but those probability measures we will be interested in are those in Myi. 
For each v' € V[S') denote by C^' the set of all consistent probability measures of 
. We state as our next result the following lemma concerning a single-site variational 
principle for the CFLM. 

Lemma 3.7 For any probability measure v £ M^i the relative entropy S{v\a v') 
satisfies 

S{D\a®v') > -^^^\TTiD,TTiD)+D[logk{;-)]-Ji^'{dr]^)\og j a{dai)e-^''^^^^^^^^^^^k{aur]^). 

(25) 

In particular equality is attained whenever u G Cyi . 

Proof: For any v £ M^i the expression on the right-hand-side of (j25p becomes 
-$«(7riP,7riP)+P[logfc(-,-)] - / v'{dr^i)\og [ a{dai)e~''^'^^^'''^-^h{ai,rji) 



(26) 

The proof now becomes showing that 

S{i>\a^u')- [ p(dC,)log j--^^(e,|P)| >0. (27) 
JsxS' ld{a(^iy') J 

The case for measures D E M^i with S{i'\a u') = 00 is trivial since we get strict 
inequality by the boundedness properties of $ and k. 

Now for the case oi D £ M^^i with S{i'\a ® z^') < cxd, we obtain 



SxS' 



d{a (g) u') J J SxS' I d{a u') 



i){dC,)log\-^^iCi) } = 5(P|7.'(-|^)) > 0, 



(28) 



since 5(p|7^'(-|P)) > with equality holding only when u = This concludes 

the proof. 
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□ 

We state as our next result a theorem concerning some function ^'j,/ on M^,/ which 
is dominated by Ji,' and coincides with J^i on C^'- 

Theorem 3.8 For any given v' £ V{S'), the function J^i satisfies 

Ju'{^) ^ ^u'{^) where 



In particular Jyi coincides with on C^i and if ^ is homogeneous of degree p then 
becomes 

= il-p)^nii)) - J i^'id7ji)log J a{ddi)e-''^'^^^^''''^^h{a„rji). (30) 

Proof: The expression for ^i^i and the inequahty (j29p follow by substituting the lower 
bound on S{-\a v') in Lemma 13.71 into the expression for J^i in Proposition 13.21 
Furthermore, if $ is homogeneous of degree p, we then have 



d 



dt 



d 



$(^)(7riz>,7riP) = — <I)(7riP + tvTiP) = — (1 + t)P<I>(7riz>) = p<I>(7riz>) (31) 



t=0 



t=o dt 

and putting this into the expression for in (j29p yields the desired expression in (j30p . 

□ 

Corollary 3.9 The transformed LDP rate function J' now becomes 

J'{u') = S{i''\a') + (^k{T^') - coTisi, where ^>fc(i^') = inf '^v'ii')- f32) 

Remark: 1. The above expression for the transformed rate function J' is a conse- 
quence of the fact that J,ji coincides with on Cyi . Thus 'I'fc is the interaction for the 
transformed system arising from the initial system described by $ and subjected to the 
site-wise transformations governed by k. 



3.0.1 Examples 

Take $ to be an Ising mean-field interaction (i.e. S = {+1, —1}) given by 

$(m) = -^mP (33) 
p 

where m S [—1,1] and p > 1. Here the reason for using m instead of probability 
measures on S is that the probability measures on S are uniquely determined by m, i.e. 
each m G [—1,1] can uniquely be associated with a probability measure (say v) on S 
given by v{ai) = ^^^5+1(0"^) + ^-i^6-i{ai). The expectation w.r.t. v then gives rise to 
m. 

We take k{ai,r]i) = pt{ai,r]i) to be the transition probabilities (i.e. pti(Ti,r]i) is the 
probability of starting with cjj at site i and observing iji after t time units) for rate one 
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site-wise independent spin-flip dynamics on S [25]. Here both S and S' are the same, 
and the a priori measures a = a' = ^{S+i + 6-i). More precisely, Pt{o'i,rii) is given by 



Pti(Ti,Vi) = 1^ TTT-T, where ht = - log . (34) 



As pointed out above, we will denote by r G [— 1, 1] the expected values of the probability 
measures on the transformed single-site space. We will also write ml for the expected 
values of the first marginals of probability measures on S" x with fixed second marginal. 
The fixed second marginal will be assumed to have mean r. 
Then ^. for this set-up becomes 



$^(^') =1^_1M^'P _ 1±I log jcosh (^13771'^"-^^ + ht) } 

log jcosh (^/3m'^^~^^ ~ } ^ cosh(/it)) 
Consequently, this form of gives rise to the mean-field equation 



(35) 



m' = tanh f /Jm'^^"^) + ht\ + - — - tanh f /^m'^^"^) - hA . (36) 



Remark: In the case p = 2, is the Hubbard-Stratonovitch potential function 
The unicity of global minimizers of this potential function played a crucial role in deter- 
mining the Gibbs and non-Gibbs properties of the corresponding transformed system 
studied in [253 • 

In [25] the derivation of ^. is based on the quadratic nature of the interaction <I>. 
The technique employed there cannot be used to derive ^. for non-quadratic interactions 
and this is where our approach comes to the rescue, i.e. our approach of deriving via 
the machinery of large deviations is adaptable to more general mean-field interactions 
(both discrete and continuous spins). 

We now discuss in detail Gibbsianness and non-Gibbsianness for mean-field models as 
introduced in Definition! 



3.1 Gibbsianness for Transforms of mean-field models 

In this subsection we study the Gibbs properties of the transformed measures ^ 
introduced in Section 12.21 This investigation of the Gibbs properties of transformed 
measures shall be based on the continuity properties of the conditional distributions 7' 
as a function of the conditioning. Before we formulate this, let us fix some notations 
that we shall use in our formulation. For each N > 2 and 1 < n < we denote by 
flN-niWMXVn] the joint system in Vjy \ Vn when the second-layer spins are constrained 
to a given configuration f] e ft', i.e. ??Vjv\V"n projection of f] onto As we 

pointed out in the above, a representative ??viv\v„ of ^ class of configurations in 
with the same empirical measure L7v-n(W]v\Vn) will give rise to the same measure 
jl]\f-n[vvN\v„]- Therefore by fixing r?Vjv\v„ imphes we are restricting attention to only 
the configurations in a subset of S''^~^ with fixed L]^-n{'nvN\v„) ■ Suppose ?7V]v\v„ 
is one such representative, we call fl]y-n[f]Vr^\Vn] the restricted constrained first layer 
model (RCFLM) for the corresponding mean- field model. It is is restricted because we 
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are not taking into account the spins in and constrained because we liave frozen the 
configurations in the second-layer to fjvr^\v„- More precisely, 



AiV-n [VVN\vJ{d(TVM\V„ ) 



/exp(^-iV$(7riz>Ar,n) j riiln+i k{&i, r]i)a{dai) 
/exp(-iv{$(7ri^iv,n) - ^JV,n[logA;]}) U^=^+i "{d^i)' 



(37) 



where l^N,n = ^LN-n{^Vi,\V„), LN-n{^Vi,\vJ = EiIn+1 ^iauVi) and = (CTj, r]i). 

Remark: Suppose u' G 7^(5") is the empirical measure for the configuration fj. Then 
for a fixed n, the sequence of measures i'N,n under the pushed-forwards of flN-n[f]Vi^\Vn\ 
satisfies an LDP with rate N and rate function Jj,/ : Mj,/ — > M U {+cxd} given by 



Ju'iy) = Ju'iy) — const,^', where const,y' = inf Ji,i{y). 



(38) 



The validity of the expression for Jj^/ lies in the fact that for f] G Q' constrained to 
have empirical measure v' , the sequence of empirical measures Ljv-n(Cvjv\v„) converge 
weakly in the A/^-limit to an element in M,^/. For any of such measures u for which 
^{'\Vi) ^ (for v' almost all r)i G S'), 



^((Ti,(i?7i) = v'{drii)f^i{(7i\rii 



(39) 



Therefore, the relative entropy of such probability measures u w.r.t. a then becomes 

S{u\a)=J ly'idrji) J f„'. {ai\rji) log f^f,{ai\rii)a{dai). (40) 

Our next result in this subsection concerns a representation of the finite-volume 
transformed conditional distributions /j,'^ n('\VVn\v„) in terms of the RCFLM p,N-n['nvN\v„ 

Lemma 3.10 Let N, n and fivN\Vn above. Then the finite-volume conditional 

distribution A*^ jv("IWjv\v„) for the transformed system has the form 



l^n,N{dVvJVVN\Vn) 



fiN-n[nVN\vJ 


n / e-^^''(-'^^'-'''^iM^)k{a^,v^)a{da^)a'{drJ^) 
.1=1 ^ 


liN-n[WN\Vn] 


' n „ 

/ e-*''H-i^-.-5<^0+H^)a(da,) 
.1=1 ^ 





(41) 



Proof: Note from the definition of the the transformed system that we can write 
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f^'n,NidVvJVVN\vJ 



as 



/ /, I- N i=n+l 



N . n „ 



N 



_ (42) 

where the joint configuration is such that CvnXVu = ^j)ieVjv\v„ and = 
{(^i-,Vi)i&Vn-, and .^i = {ai,fii). Now by writing the joint empirical measure as 

— N — TL — Tl 

and adding and subtracting N^{^^^L]sf-n{^VN\v„)) from the exponent N^{L]\f{^VM)) 
we obtain 



^ i=i 



— — LN-n{ivN\V-a)->^ai 



+ 



1 

(43) 



where o (■^) is a result of ([T|) of Definition 12.11 where we have taken t = j^. Finally, by 
putting this expression of {Ln{^Vn)) i^^o the expression of 
l^n N(dilVn\VvN\v„) (|32|) and multiplying the resulting expression by 

Uf=n+i I o^idaj ) exp (-N<^ (vtiPtv,™) ) k{aj , fjj) 

— J (- . (44) 

conclude the proof of the lemma. 

□ 

We now state the infinite-volume {N oo) version of Lemma I3.10[ A sufficient 
condition for the existence of the finite-volume conditional distributions with infinite- 
volume r/-conditioning is provided. This sufficient condition is the unicity of the global 
minimizers of the function J^', v' G V{S'). 

Theorem 3.11 For each v' £ V{S'), let J^/ : M^, MU {-Foo} be as defined in (fTT]) . 
Suppose further that for a given u' G V{S') Jyi has a unique global minimizer D* G M^i. 
I) Then 



7n (dvvn I f') = Jim^ f^n,N {dWn I VVj^\v„ ) = n ^1 ('^^^ I ' 

,{dcj,)e''''"^i^'''^'"^)k{a,,m)(y'{d7^i) (45) 



I'lidmW) 



s 



a 

s 



II) If J\i has a unique global minimizer A* for all X' in a weak neighborhood of v' , then 
'j'nidriVnW) ^-5 weakly continuous at u' as a function of the conditioning u' G V{S'). 
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Remark: 1. The v' dependence of the expression for ^'^{da'^\v') is hidden in via 
the probabihty measure ttiv* . 

2. Theorem 13.111 provides a sufficient condition for the transformed system to be 
Gibbs (in the sense of Definition I2.2p , namely the unicity of global minimizers of J. . 
Thus the problem of determining whether the transformed system is Gibbs or not is 
then translated into the corresponding problem of studying the global minimizers of J. . 

Proof of Theorem \^AA\ 

I) The proof follows by way of the form of the finite-volume conditional distribution 
given in Lemma [3.101 and the hypothesis that the function J^i has a unique global min- 
imizer. Because the leading term in the large A'' asymptotic of Ar('^Wn IWjvVVn) is 
governed by the global minimizers of Jyi. 

II) By hypothesis we get continuity for free by the continuity properties of the mean-field 
interaction <I>. 

□ 



4 Gibbsianness of transformed systems and the contrac- 
tion map theorem 

This section is devoted for studying the minimizers of the function J. for some special 
class of initial interactions $. Up to this point all topological considerations have been 
w.r.t. the weak topology, i.e. the weak topology is sufficient to study Gibbs measures 
and Gibbs properties of transforms of Gibbs measures for mean- field models. We now 
consider another topology on the spaces of measures which is stronger than the weak 
topology. This topology is the one induced by the total variational metric. Continuity 
in this new topology implies the continuity w.r.t. the weak topology. All topological 
considerations for the interactions we consider in this section shall be w.r.t. the varia- 
tional topology. Additionally, we also impose further smoothness requirements on the 
initial interactions <I> other than those given in Definition 12.11 All these restrictions on 
the interactions are required to derive explicit continuity estimates on the CFLPK's and 
consequently on the transformed kernels. 

To be precise, we consider interactions ^ that are given by 

^{v)=F{y[gil...,u[g,]), (46) 

where gi are some fixed bounded non-constant real-valued measurable functions de- 
fined on 5, Z > 1 and F : ^ M is some twice continuously differentiable func- 
tion (e.g. if F is a polynomial). In the following we will write g = {gi,--- ,gi) and 
vlg] = {i^igi],-" j^[gi])- By setting rrij = mj{v) = iy[gj], we have for this choice of 
interaction that 

I 

'^^'H^^^aJ = j;F,(i.[gi], . . .,u[gi]))g,{a,), (47) 
i=i 

where Fj{m) = -£^^F{m) and m = m{v) = v[g\. We also set Fju{m) = g^^g^^ -F(m). 
Additionally, we assume that g is Lipschitz- function from 5 to M', with Lipschitz-norm 

II II lb('7i) -9{o-i)\\2 ,,oX 

5- ^,2 = sup — , 48 
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where d is the metric on S. We also denote by 6{g) the sum of the oscillations of the 
components of g, i.e. 

I 

S{g) = Y,6{g,). (49) 
i=i 

For any g satisfying the above conditions we set 



Dg = Mg] : V G V{S)}. (50) 

Note that Dg is compact subset of M' by the boundedness of g. In the sequel we will 
write ||9^-F||inax,oo for the supremum of the matrix max norm of the Hessian d'^F . i.e. 

||9^-^||max,oo = sup ||9^-F(m) ||max, where ||9^-F(m)||max = max \Fi^{m)\. (51) 
Furthermore, we also set 

5($W)=sup sup I VF,(m)(5,(^7,)-5j(^i)) ■ (52) 

Up to this point one may wonder whether the class of interactions we are considering 
in this section has any physical relevance. Indeed, it contains important mean-field 
interactions like the Curie- Weiss interactions, liquid crystal interactions, sums of "p- 
spin" interactions ect., that have featured prominently in the literature. 

4.1 Lipschitz Continuity of the CFLPK and Gibbsianness of Trans- 
formed System 



We have already seen from the remark below equation (j22p that the CFLPK 7. is 
weakly continuous. In this subsection we show, however that the CFLPK is Lipschitz 
continuous w.r.t. the variational metric (defined below). We write 



\u — v\ = sup \v{ip) — 

- _ , _ _ I (^^) 

= sup-^ r-— where 6{ip) = sup \ip{ai) - ip{ai)\, 

for the variational distance between the probability measures v and where the supre- 
mum are respectively taken over all measurable real- valued functions ip with \(p\ < 1 
and bounded non-constant measurable real-valued functions on S. The variational dis- 
tance can also be define by the following consideration: The signed measure v — u has 
respectively {u — v')'^ and {u — u)" as the positive and negative parts of its Jordan de- 
composition. But the fact that {v — i>){S) = 0, implies that {u — h')'^{S) = — i')~{S) 
leading to the definition of the variational distance between v and 1/ as the one-half of 
the total variation of — v'), i.e. 

\\u - = (y- vY{S) = {y- vYi^S). (54) 

Before we state our first result in this section let us fix further notations. We set 

C{F,g) = 2||52F|Uax,oo 6{g)\\g\U2 exJ^—^) and 

r (1 ^^^^ 

Pa{k) = sup inf ( / (f{ai,ai)k{ai,7]i)a{dai))^. 

rji(zS'aieS\Js / 
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Theorem 4.1 For any u' G V{S) and each pair vi,V2 G M^i , the CFLPK satisfies 

\\lu'{-\i>i) - lv'{-\v2)\\ < L\\i'i - i'2\\ where L = L{F, g, k) = C{F, g)pa{k). (56) 

The above theorem says that for each u' G V{S'), the CFLPK 7^/ is Lipschitz continuous 
on M^i with Lipschitz constant L. 

Remark: 1. The quantity Pa{k) is the (metric-space version of) standard deviation 
of the single-site "posterior distribution" K(dai\r]i), when we take supremum over the 
possible observations r]. So, it describes the worst-77 scenario of the typical size of 
fluctuations in the initial configurations which have led to 77. The constant L factorizes 
into two constants reflecting the idea of "nature C{F,g) versus nurture paik)" . 
2. Set 

a{dai)d'^{ai,ai 



Po 



inf 

aieS 



(57) 



i.e. Pa is the metric space version of the standard deviation of a. Then the initial kernel 
71 for the interactions considered in this section is also Lipschitz continuous, i.e. for 
any pair i^i, 1^2 S ViS) we have 



Il7i(-ki) - 7i(-|'^2)|| < - i^2||, where L = C{F,g)pa 
L is the "Dobrushin's constant" for the initial mean- field model. 



(58) 



Proof of Theorem 14. It Take a measurable map / : x S" ^ M with |/| < 1. Also 
for any pair v'1,1'2 G ^u' any < s < 1 we define Dg := sPi + (1 — s)i'2- Then we 
have 



lu'iflh) - lu'{f\i>2 



u\dr]i) I a{dai)f{ai,rji) / ds-^hu^{ai,rii) 



, where 



e-'^'^'H-i^sM)k{ai,r]i) 
Je-'^'''(^^''^Mh{aurji)a{dai)' 



(59) 



We also set Xs[r]i]{dai) = hu^{ai,rii)a{dai). Now using the form of the interactions 
considered in this section, it is not hard to deduce that 



I I 



— ^>(^)(7riz>s,5^-) = ^^Fju(m(7riPs))5'j(f^i) / '^i{i>i - i'2){dai)gu{ 

j = l u=l 

This and further computations yields the following expression for -^hi;,/. 

, I I 

d 



(60) 



ds 



hu,{aur]i) = -^^Fju{m{Trii>s))il^uhu,{ai,r]i)(gj{ai) - XshKdj)) , where 



j = l u=l 



^« = y 7ri(Pi - V2){dai)gu{(yi). 
Observe from (1530 that 

/ v'{drii) J {i>i{dai\r]i) - i'2{dai\rii))gu{(Ti) 



(61) 



(62) 



< K9u)\\t>i - M 
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Putting all these together we arrive at 



< llt'i - ^'2|| y]5(5«) / ds\Fju{'m{TTii>s))\ / u' {dr]i)(f{s,r]i 
j=i u=i -^0 



where 

vis,Vi) = J a{dai)hi^^{ai,r]i)\gj{ai) - Xs[r]i]{gj)\- 

(63) 

By adding and subtracting gj{ai) (for any arbitrary Oj G 5) from the term gj{cri) — 
Xs[r]i]{gj) in the definition of 92(5, r/j) and applying the triangle inequality we arrive at 
the following: 



(p{s,r]i) <2 J a{d(Ti)hi,X(Ti,rii)\gj{(7i) - gj{ai)\. (64) 
Further, it follows from Holder's inequality that 

j a{dai)hi,X(yi,rii)\gj{'^i) - gj{ai)\ < {^j a{dai)hy^{ai,r]i)[gj{ai) - gj{ai)f^ . (65) 

Now by replacing |Fju(m(7riz><j)) | with ||9^-F||max,oo in (|63]l and using the fact the 
square root function is concave we obtain 



lu' { f - -lu'if \i'2) < 2||Pi - i>2|U(5)l|c^^^l|max,oo y ds j u' {dr]i)x{s , T^i , tti) where 

/I 1 
a{dai)hj:,^{ai,r]i)^{gj{ai) - gj{aijfY ■ 

(66) 

Note further that the ai appearing in x is chosen independent of all the parameters in 
the model, so taking the infimum over Oj will have no influence on our estimates. In 
view of this observation, replacing xisjtjii with infagg "f/i, Oj) will have no effect 
on the inequality in (j66p . Furthermore, it follows from the Lipschitz property of g and 
the fact that hi^^{cri,r]i) < e^^^^^^^ k{ai,rji) that 

inf x{s,r]i,ai) < \\g\\d,2e 2 sup inf / a{d(T,i)k{ai,r]i)d (cTj,ai) . (67) 



Putting the bound in (|67j) into the bound in (|66j) yields the desired result. 

□ 

Observe from Theorem 14.11 that if the constant L < 1 then j^i defines a contrac- 
tion map from M^i to itself. This is because the variational distance turns the set 
Ml,' = 7^2^ ^ complete metric space by continuity of the map 7r2 under the 

variational topology. Thus the CFLPK admits a unique consistent probability measure 
and consequently the existence of a unique global minimizer for J. , since the minimizers 
of J. are contained in the set of consistent probability measures for the CFLPK. This 
then implies that the transformed system is Gibbs. 

The next item on our list of tasks is the investigation of how (in the regime L < 1) 
the unique consistent probability measure D* for 7,^/ behaves w.r.t. u' G V{S'). Indeed 
we show in the proposition below that u* depends continuously on u' . 
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Proposition 4.2 Suppose the constant L < 1, then under the variational metric the 
unique consistent probability measure u* for is Lipschitz continuous w.r.t. v' and 
has the Lipschitz norm Li = 4 L. 

Remark: The constant L here is comparable with the uniform bound on the Dobrushin 
constant c'[ri\ for the restricted constrained first layer model considered in [21]. Due to 
this, it is of interest to obtain the Lipschitz constant with the factor reminiscent to 
the upper bound on the row sums of the Dobrushin matrix D. Thus for L < 1 we can 
also have the constant Li given by 



Li 



1-L' 



with Li < Li. 



(68) 



Proof of Proposition 14. 2t Let 1^2 ^ T^i^'), then the assertion of the proposition 
follows by showing that, 

hu[ - lu'J < LilWi - i^all, (69) 
since if u* is the unique consistent probability measure for 7^,/, then i>* = ' 



\i>*) for 



1,2. Observe for any measurable function on S x S" with |/| < 1 that 



< 



+ 



{^1 - ^'2){dVi)(^{d(yi)f{(^i,Vi)[hc,*{(^i,'ni) - hy*{ai,r]i) 
{^'1 - i^'2)^idr]i)a{dai)f{ai,rii)(^hi^*{ai,rji) - hj^*{ai,r]i) 
i'^'i - ^2Y{dr}i)a{dai)f{ai,r]i)ihyi{ai,r]i) - /is* (di, 77,) 



(70) 



where {u'^ — 1^2)^ ^^'^ i^'i ~ respectively the positive and the negative parts 

of the Jordan decomposition of the signed measure v'^ — v'2 and hy is as given in (j59|) . It 
follows from the definition of the variational distance between two probability measures 
that 



Iv'XfV'l) -Iv'AfV'l) < 2||i^i - J^2ll sup / a[dai) hy*{ai,r]i) - hy*{ai,r]i) , (71) 

since we have chosen / to be such that |/| < 1. Now we proceed by setting 
D* = s^l + (1 — s)z>| with < s < 1 and observing that 

J a{dai) hc,i{ai,r]i) - hc,*{ai,r]i) = J a{dai) J ds-^hu*{ai,r]i) 

< J ds J a{dai) -^hy*{ai,r]i) 
But we know from the proof of Theorem 14.11 that 

ds J a{dai) -^hu*{ai,r]i) < C(F, g')||z>]" - jnf^(| / a{dai)k{ai,rji)d'^{ai,ai) 



(72) 



(73) 

Now by replacing ||z>^ — z>||| by 2 and substituting the bound in (j73p into the inequality 
in ()7ip conclude the proof of the proposition. 
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Having disposed of the continuity estimates for the CFLPK, we now turn our attention 
to study the corresponding continuity estimates for the single-site kernel for the 
transformed system in the regime where L < 1. Since in this regime the CFLPK has a 
unique consistent probability measure and consequently providing a sufficient condition 
for the function J. to have a unique global minimizer as is required by Theorem 13.111 



4.2 Continuity estimates for 7^ 



A sufficient condition given in Theorem 13 . 1 1 1 for the existence and continuity of the finite- 
volume kernels 7^ for the transformed system is the unicity of the global minimizer of 
the function J^i . This sufficient condition holds if the CFLPK has a unique consistent 
probability. Our main result in this subsection is the following theorem concerning the 
continuity estimate for 7J: 

Theorem 4.3 Suppose that L < 1, then under the variational metric on V{S') the 
single-site kernel 7x(-|i^') is Lipschitz continuous w.r.t. v' with Lipschitz constant L2 
given by 

L2 = LiL. (74) 

Remark: 1. The Lipschitz constant L2 for the transformed system factorizes into 
the product of the Lipschitz constants L and L respectively for the CFLPK 7. and the 
initial kernel 7. 

2. Observe from the remark below Proposition 14.21 that we can also have the Lipschitz 
constant L2 given by 

L2 = LiL = with L2 < L2. (75) 

1 — L 

Proof of Theorem 14. 3t As usual let us take and z^l as the unique consistent 
probability measures for the CFLPK corresponding to u'l and respectively. Again set 
i>* = si>J -|- (1 — s)i'2 for < s < 1. It follows from (j45p after taking a measurable map 
/ : 5' ^ M with |/| < 1 and setting 

^ M^.,r?,)e-^'^'(^i^:.^..) 



that 



I'lifWi) - li{fW2) = I (^'idr]i)f{r]i) I a{dai) I ds-^hu*{ai,r]i) with 



ds 



II ^ ( ^ ^^^^ 



j = l u=l 

and where ^s(.gj) = I a' (.dr]i)a{dai)hu*{ai,rji)gj{a, 
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Therefore it follows from our previous considerations that 

MWi)-l'liM)\ < l|9'i^l|max,oo 6{g)\\l)l - U^WX 
I ,1 



/ / a{dr]i)a{dai)hc*{ai 



Vi 



gjicTi) -'dsigj) 



(78) 



By adding and subtracting gj{ai) from the term gj{ai) — 'ds{9j) and applying the triangle 
inequality we obtain 



a'{drii)a{dai)hj^*{ai,rji) gj{ai) - 'ds{gj) < 2y a' {dr]i)a{dai)hi,,{ai,rii) gjiui) - g.j{ai) 

(79) 

Now it follows from Holder's inequality and the facts that (1) the square root function 
is concave (2) hi,*{ai,r]i) < e^^^^^^'^k{ai,rji) and (3) the Lipschitz property of gj that 



/ / a{dr]i)a{dai)hci{ai,r]i 



ai^S 



1 

v2\ 2 



< 2 / ds inf a'{drii)a{dai)hy*{ai,r]i){gj{ai) - gj{ai)Y 

° J=i (80) 

< 2 y ds iid^i^ I a'{dr]i)a{d(Ti)h^*{(Ti,rji)^[g.j{(Ti) - gjiai))' 

j=i 



1 

v2\ 2 



< 2||5||d,2e^~ inf ( / a{dai)d\ai,ai\ 



Finally by putting this bound in ()80p into (|78p and noting from Proposition 14.21 that 
||i^r ~ j^lll — -^ill^i ~ ^2 1 1 follows the proof. 
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5 Examples 

We now present two examples for the class of models discussed in the above section. 
In the first example we consider specific forms of the functions F and g, and a specific 
form of the joint a priori measure K. The second example is about general forms of F 
and g, and a specific form of the joint a priori measure K. 

5.1 Short time Gibbsianness of rotator mean-field models under dif- 
fusive time evolution 

The first example we consider is the Curie- Weiss rotator model under site-wise inde- 
pendent diffusive time-evolution. Here the single-site spin space for both the initial and 
the transformed systems are the same, i.e. S = S' = S'^~^, where 5'^"^ is the sphere in 
the q-dimensional Euclidean space with q > 2. The interaction for the initial system is 
given by 

$(.) = F{u[a}], . . .[af]) = - ^^^=1 ^ , (81) 
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where gj{(Ti) = aj is the jth coordinate of the point a-i £ S"^ ^ and / = q. 
Next let K be given by 

K{dai,dr]i) = Kt{dai,dr]i) = kt{cri, r]i)aoidai)aoidr]i) , (82) 

where ao is the equidistribution on S"^~^ and kt is the heat kernel on the sphere, i.e. 

[e^*ip^{r]i) = j aoidai)kt{ai,rii)ip{ai), (83) 

where A is the Laplace-Beltrami operator on the sphere and 99 is any test function, kt 
is also called the Gauss- Weierstrass kernel. For more background on the heat-kernel on 
Riemannian manifolds, see the introduction of [l]. 

Lemma 5.1 Assuming the above set-up the Lipschitz constant L for the CFLPK is 
given by 

L = L{q, p, t) = 4V2q(3e^ (l - e'^^"^)*) ' . (84) 

Proof Lemma I5.lt The proof follows by just evaluating the terms appearing in 
C{F,g) and Pa{k)- First observe that for the interaction considered in this subsec- 
tion ||9^F||max,oo = /?, 5(5) = 2g, \\g\\d,2 = 1 sincc d{ai,ai) = \\g{ai) - giai)\\2 and 
^($(1)) = 2/?. So these give rise to 

CiF,g) = C{q,P)=iqPef'. (85) 

To obtain paik) we argue as follows: Denote by u{t) the q-th component of a diffusion 
(Ti(t) on the sphere started at u{0) = 1 (in the "north-pole") and E the expectation 
w.r.t. the corresponding diffusion. By reversibility we now choose Oj = r]i such that 
Oj is the north pole . Then it is not hard to see that 

d\ai{t),ai) = 2{l-u{t)). 

This gives rise to 

j ao{da)kt{ai{t),ai)d\ai{t),a,) = 2(1 -En(t)). (86) 

It follows from the above and the rotation invariance of the diffusion on the sphere that 
to compute Pa{k) we only need the qih. component of the diffusion which according to 
the Laplace-Beltrami operator on the sphere has generator of the form 

(i-«')(l;)'-('-'>«^- 

This generates the equation ^Eut = —{q — l)Etif. Solving this equation with the initial 
condition u{t = 0) = 1 yields 

En(t) = e^(^-i)*, (88) 
which leads to the desired expression for Pa{k), i.e. 

Pa{k) = V2(l - e-(«"^)*) ' and therefore (89) 
L = 4^/2g/3e^(l-e-('^-l)*)" 
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Remark: 1. The smallness of L for this example emanates from at least two sources, 
namely; small values of t and (5. That is if (3 is small enough the system will 
be Gibbs at all times. But if we start with large (3 then we hope to preserve 
Gibbsianness at only small values of t. 

2. We also have for any arbitrary chosen that 

j (f{ai,ai)a{dai) = J 2{1 - a ■ a)ao{da) = 2. (90) 

Thus we have for this example pa = \f2. 

5.2 Local approximation and preservation of Gibbsianness 

As our second example we start from an initial compact Polish space S endowed with 
a metric d and an a priori measure a. We consider general F and g defining the initial 
Hamiltonian. 

We partition the initial space S into finitely or countably infinitely many disjoint 
Borel sets with non-zero a measure indexed by the elements in 5', i.e. 

S= J S^^, with q(5„J>0 for ah m^S'. (91) 

We then consider the deterministic map T : S ^ S' , such that T{ai) = iji for all 
CTj G Sj^^. That is, every point is mapped to the label of the class it belongs to. If we 
start with a finite initial space, this transformation is the so-called fuzzy-map which, 
when starting from an initial Potts model, was studied in [20j. In the present generality 
this example was studied in [23] , and we want to see here what the mean-field estimates 
of the present paper provide. Let us formulate the form of the Lipschitz constant L for 
the CFLPK resulting from the local approximations in the following lemma. 

Lemma 5.2 Assume the set-up above, then the Lipschitz constant L is given by 

L = L{F,g,T) = C{F,g) sup a{S^X'^ inf ( /"a,^ {dai)d^{a,,ai)y . (92) 

The proof of the above Lemma follows straight away from the definition of the constant 

L and observing that a{dai)k{ai,r]i) = ^ . 

Once again, the constant L will be small either if the initial interaction is weak 
enough or the local approximation is fine enough. For L < 1, by the general Theorem 
14.31 this implies Gibbsianness and continuity estimates of the form (j74p . 

6 Appendix 

6.1 Proof of Theorem ISTSl 

The assertion that J^' attains its infimum on M^' trivially follows from the lower 
semi-continuity of J^i and compactness of M^i. Now to proceed with the rest of 
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the proof we take any minimizer D* of J^i which has the representation u*{d^i) = 
v' {d-qija^dcji) f yi {ai\rii) . Then it remains to show that fyi (1) is a u'-a.s. strictly posi- 
tive and (2) takes the form (|20p a ® v'-a,.s.. 

(1) We now proceed to show the almost sure strict positivity of the minimizing con- 
ditional a— density fyi. For each rji G S' we set A^. = {ai G S : fu'{(^i\Vi) = 0} ^-i^d 
denote B by the set of r/j's for which ^(A^.) > 0. The proof consists of establishing a 
contradiction that z>* is not a minimizer of J^/ whenever B has a positive z/' measure. 
This is an adaptation of arguments found in [2] and references therein modified to suit 
our case. 

To be precise let 6 : 5 — > M be a strictly positive measurable map with 6 < 1 and 
for each rji £ B we define a bounded measurable map g{-\rii) : S ^ hy 

9{(^t\Vi) = 7—7 ... X . ,~ ^ • (93) 

'n 

Now set Urn = 1 + Ia b{^i)(^{d(^i) ^-iid further define for each r/j G S' 



£9{-\Vi) + (1 = :;^^^Ar,^b + Ce, r^,l A- J u'[-\m) '^^ m ^ B 

Pe{-,Vi) = { , where 



Ce,v. =e{—-l) + l 

Uru 



(94) 



and £ G [0, 1]. It is easy to check that 1 < Ur^ < 2, which implies that Cg^^. as well as 
logCg^^^ is uniformly bounded. Let us set t'*(d^j) = u' {dr}i)a{ai)pe{ai,rj.i) and observe 
from the above that we can write the relative entropy of w.r.t. a®v' as; 

S{ul\a®u') = [ u'{dri,)S{i>*{-\7],)\a)+ [ u'{dri,){— [ a{dai)b{ai) log b{ai) 

Jb'= jb I Uri^ J A,,. 

+ — log— a{doi)h{ai) + Ce,n, i a{dcFi) fi,' {a i\rii) log f„i {a i\r]i) 

+ Ce,r,, logCe,r,, a{dai) fu' {(Ti\r]i)^ . 

(95) 

Now we define a function /i : [0, 1] — > M by 

h{e) = {D*) = S{Dl\a ® v') + ^(vriP;) - z>* [log k\ . (96) 

It follows from the uniform boundedness of C^^^- and log C^^^. , Lebesgue's dominated 
convergence theorem and the continuity property of <I>^^^ that h is continuously differ- 
entiable on (0, 1). Observe that /i(0) = Jyi{y*^ and one would expect h to be decreasing 
as e 1 0, i.e. h(e) — h{0) > for £ close to zero. But we will show that the converse of 
the above holds if B has positive ly' measure. More precisely, differentiating h we obtain 
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for e £ (0, 1) 



h'{e) 



B 



^'{dr]i)l — log — I a{dai)b{ai) + ( — - 1 ) logC^,^^ / a{dai) f^> (aili^M 



B 



+ / v'idr^i) I a{dai) {g{ai\r]i) - U>{ai\r]i)) cD^i) (7riP*,5,,J + C{B,b,f,,), 

(97) 



where C{B,b, f^') is a constant which depends on B, b and f^,' but independent of e. 
Assuming ^'{B) > then the hmit 



l\m h'ie) = — oo 

e[0 



since the term 



/ v'{drii)— log — I a{dai)b{ai) (98) 

JB Uv^ Ur,^ JA^^ 

goes to negative infinity whiles the rest remains bounded. This implies that 

,. h(£) - /i(0) , , 

hm ^ = -oo 99 

eiO e 

giving rise to a contradiction since v* is a minimizer, hence the assumption than 
i^'iB) > is false. This concludes the proof of almost sure strict positivity of f,^/. 

(2) Next we prove that the conditional a-density of any minimizer of Jjy' must sat- 
isfy the constrained mean-field equation ([20}) . Let f,y'{-\-) be a conditional a-density of 
a minimizer z>* of J^' and set 

= ^^^^(vriP*,^^^ - logk{ai,r]i) + log fi,' {ai\r]i) 

f / fu \ (100) 

+ log / exp (^-«>W(vriP*,,^^Jj k{au7]^)a{ai). 

We are now left to show that a{Yr^. = 0) = 1 for z^' almost all r/j G S'. 

The idea of the proof is again to assume the contrary, and arrive at the contradiction 
that a suitable perturbation of the conditional a-density f^' would have a lower value of 
J^/. That is we assume a(y^; 7^ 0) > on some subset B of S' with positive 1^' measure. 
In a first step this implies that for any r/, G B both inequalities a(l^. > 0) > and 
Q(y^. < 0) > must be the case. Indeed, the assumption that e.g. the second inequality 
is not true leads to a contradiction. To see this define for each iji £ B and 5 > 

= £ •5 : ipu'icTilVi) > ^^j) with 

exp(-$W(7ri^*,<5,J)fe(ai,77,) (101) 



ipu'icTilVi) = fu'{cri\r]i) 



J exp (-$(i)(7riz>*,5^J) k{ai,r]i)a{ai 



so that we would have lim^io Q^(^^J = 1- 

Taking the a integral of ipu' i^^ilVi) yields then the contradiction 



a{dai)ipui{(Ti\r]i) > I aidai)^;^' {cri\r]i) > 5a{A^J > (102) 
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for 6 sufficiently small. This and the a iX" i^'-a.s. strict positivity of the minimizing 
conditional a-density imply that a(l{y^.<o}/;/'('l'7i)) > and Q;(l{y^,>o}/,y'(-|?7i)) > 
0. Now we set for each 77 j G S' 

_ «(l{y,,>o}^'(-ki)) 
^vi — ~~r, T~r\ — vT 

and define for each positive integer n G N the set 

Bn:=[m:Cr,,e{n-l,n]]- (104) 

Observe that the (-Bn)n>i is a partition of the set B C S' , i.e. B = IJ„>i -Bn and the 
BnS are pair-wise disjoint Borel subsets of S' . We now consider a perturbation of the 
minimizing conditional a-density fvi{-\'r]i) whose form will be dependent on the choice 
of the transformed configuration r/j. More precisely, for each e € [0, 1] we consider the 
perturbed conditional a-density Pe{(^i\f]i) of form 

00 

Pei-W) = ^B'=fu'{-\Vi) + XI ^B„{r]i)fe,n{-\Vi), where 

n=l (105) 
fe,ni-h) = (1 ~ 3 ^{Yr„>0}fu'i-h) + + l{y^,<0}^'(-|?7i)- 

Observe that for each r]i G S' Pei'lVi) is a probability density w.r.t. a and Pe=o = fu'- 
As we did in the proof of part one we introduce a function : [0, 1] ^ M and show 
that (p is decreasing for arguments very close to zero whenever iy'{B) > 0. But this 
would then imply that f^/ is not a conditional a-density for a minimizer of J^/. Hence 
for the converse to be true v'{B) = 0, which will then concludes our proof. To formulate 
this formally we set v'*{d^i) = u' {drji)a{dai)pe{(Ti\rj,i) and define 

^{e) := JAK) = S [utW ® + Hi't) - K[^ogk]. 
Now note that the relative entropy takes the form 

S{vl\a®u')=! v'{dm)S{y*{-\m)\a)+f^ f u' {dm)S . (106) 

Note that this is a bounded quantity for each e G [0, 1], since in particular 

S{vl\a®v') < AS {u*\a®i^') +2 log 2. 

Then by the Lebcsgiic's bounded convergence theorem and the properties of the in- 
teraction $ imply that (f is differentiable in the open interval (0, 1). However, by the 
convexity of S and once gain the properties of $ one can deduce that 

' tn\ r '^(^) ~ '^(0) d I 
^+(0) = = ^/A-et=o- 

Thus the rest of the proof then becomes showing that ip'_^_ (0) has negative sign whenever 
i/'(B) > 0. Let us now evaluate this quantity. Note that because of the form of 
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the perturbed conditional a-density the part of 5(z>*|a <S> v') that will play a role in 
determining ip\{^) is the part involving the BnS. For each n we set 



Sn{e) 



v'{dr]i)S {Dl{-\r]i)\a) 



and evaluate 
d 



de 



Sn{e) 



= -{ I u'{drii)Crj^ a{dai) f„'{ai\rii) log f^>{ai\r]i) 



u'{dr]i) / a{dai) f,y>{ai\rii) log f,y>{ai\rii)}. 



(107) 



(108) 



Next we also evaluate 

^Ylll] ~ I ^'(^^«) / a(d<7i)/;,/(cri|T7i)($(^)(7riz>*,(5<^J - log/c(cJi,?7i)) (xoO) 
+ / v'{dr]i)Cr,^ a{dai)fy>{ai\rii)[^^^\'Kiv\5^^)-logk{ai,i]i))\. 

JB„ JYr,<0 ) 

It follows from the above considerations and that 



£=0 



Vi| - / v'{drji) I a{dai)f^'{ai\T]i)( log f^'{ai\r]i) + ^^^\7riu*, 5a J -log k{ai,'ni 
1/' {dr]i)Crf^ / a(do-i)/i,/(cri|r/i)( log/^'(cJi|7?j) + ^>(^)(7rii>*, (5<:rJ - logk{ai,r]i) 



Bn 



Furthermore, set 



(110) 



r{7]i) = log / a{dai)exp (^-^(^^(vriP*, k{ai,T]i). 



Finally, it follows from (jlOOp and (jl03p after adding and subtracting r(r/j) from the 
integrands of the a-integrals in (jllOp that 



-(S(P;|a®i/') + ^(^:)-^niogA;]) 



e=0 



n=l •'^n J^)?; 



a{da.i)f,y'{ai\r].i) - / z^'((ir/i) / a(c?o-i)/,,'(o-i|?7i)y^, 



u'{d'i]i)Cr,j{rii) / Q!((io-i)/j,/((Tj|r/j) + / u'{drii)Cr,^ / a{dai)f^'{ai\rii)Yj^^ 



Bn 



n=l 
< 0. 



V^-i- / i^'(dr?j) / a{dai)fu'{ai\rii)Yr,^ + [ v'{drii)Cr^^ I a{dai)fu'{(Ji\r]i)Yr,^ 



(111) 
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Therefore we have succeeded in showing that 



(/9+(0) = hm 



= hm 



J,>{yt)-Ju'{y*) 



< 



e 



£ 



whenever v'{B) > 0, contradicting the initial claim that P* is a minimizer of J^,'. Hence 
for D* to be a minimizer ^'{B) = 0. This concludes proof of the claim that the condi- 
tional a-densities fyi of the minimizers of Jyi satisfy the constrained mean-field equation 
(I20D a z^'-a.s.. 
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